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Ligand Screening and Design by X-ray Cry stallography 

Technical Field of the Invention 

X-ray crystallography is useful for identifying ligands that bind target receptor 
molecules and for designing ligands with improved biological activity for the target 
receptor. 

Background of the Invention 

X-ray crystallography (crystallography) is an established, well-studied technique 
that provides what can be best described as a three-dimensional picture of what a molecule 
looks like in a crystal. Scientists have used crystallography to solve the crystal structures 
for many biologically important molecules. Many classes of biomolecules can be studied 
by cry stallography, including, but not limited to, proteins, DNA, RNA and viruses. 
Scientists have even reported the crystal structures of biomolecules that carry ligands 
within its receptors (a "ligand-receptor complex"). 

Given a "picture" of a target biomolecule or a ligand-receptor complex, scientists 
can look for pockets or receptors where biological activity can take place. Then scientists 
can experimentally or computationally design high-affinity ligands (or drugs) for the 
receptors- Computational methods have alternatively been used to screen for the binding 
of small molecules. However, these previous attempts hav e met with limited success. 
Sev eral problems plague ligand design by computational methods. Computational 
methods are based on estimates rather than exact determinations of the binding energies, 
and rely on simple calculations when compared with the complex interactions that exist 
within a biomolecule. Moreover, computational models require experimental 
confirmation which often expose the models as false positives that do not work on the real 
target. 

Moreov er, experimental high-affinity ligand design based on a "picture" of the 
ligand-receptor complex has been limited to biomolecules that already have known 
ligands. Finallv, scientists only recently reported the erystallographic study of interactions 
between organic solvents and target biomolecules. Allen et al.. J. Phys._Chem., v. 100. pp. 
2605-1 1 { 1996). However, these studies are limited to mapping solvent sites rather than 
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ligand sites. It would be desirable to directly identity potential ligands. and to obtain 
detailed information on how the ligand binds and changes in the target biomolecule. In 
addition, methods for identifying and or designing ligands which possess biological and 
; or pharmaceutical activity with respect to a given target molecule would be desireable. 

5 

Brief Summary of the Inven tion 

Crystallography can be used to screen and identify compounds that are not known 
ligands of a target biomolecule for their ability to bind the target. The method (hereinafter 
"CrystaLKAD'^ 1 "") comprises obtaining a crystal of a target biomolecule; exposing the 
10 target to one or more test samples that are potential ligands of the target; and determining 
whether a ligand/biomolecule complex is formed. I he target is exposed to potential 
ligands by various methods, including but not limited to. soaking a crystal in a solution of 
one or more potential ligands or co-crystallizing a biomolecule in the presence of one or 
more potential ligands. 

15 In a further embodiment, structural information from the ligand/receptor 

complexes found are used to design new ligands that bind tighter, bind more specifically, 
have better biological activity or have better safety profile than known ligands. 

In a preferred embodiment, libraries of "shape-diverse" compounds are used to 
allow direct identification of the ligand-receptor complex even w hen the ligand is exposed 

20 as part of a mixture. This avoids the need for time-consuming de-convolution of a hit 
from the mixture. I lere. three important steps are achieved simultaneously. The 
calculated electron density function directly reveals the binding event, identifies the bound 
compound and provides a detailed 3-D structure of the ligand-receptor complex. In one 
embodiment, once a hit is found, one could screen a number of analogs or deriv ativ es of 

25 the hit for tighter binding or better biological activity by traditional screening methods. 

Another embodiment uses the hit and information about structure of the target to dev elop 
analogs or derivatives with tighter binding or better biological activity. In yet another 
embodiment, the ligand-receptor complex is exposed to additional iterations of potential 
ligands so that two or more hits can be linked together to make a more potent ligand. 



30 
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Brief Description of the Draw ings 

Figure 1 illustrates a structure-based drug design where an initial lead compound is 
found, and then used as a scaffold to carry additional moieties that tit the subsites that 
surround a major site. 

5 Figure 2 illustrates a fragment linking approach for a biomolecule having two or 

more adjacent primary pockets. 

Figure 3 is an outline of CrystaLHAD IN1 wherein a crystal is soaked in a solution of 
various potential ligands (I r I| 0 ) and an X-ray diffraction dataset is collected and 
transformed into an electron density map which is inspected for compound binding. 

10 Figure 4 illustrates a typical compound mixture in 2-D and 3-D. The 3-D figures 

are theoretical 2Fo-Fc electron density maps that represent the "shape" of the molecules. 

f igure 5 is a primary sequence of human urokinase. 

f igure 6 illustrates how a hit was detected and identified by shape after urokinase 
was soaked in a solution containing a mixture of potential ligands. Figure 6A is the initial 
15 Fo-Fc map. f igure 6B shows how the compound binds at the active site of urokinase. 
Figure 6C illustrates the active site w ithout a bound ligand when no compound of the 
mixture has bound. 

f igure 7 illustrates a hit for urokinase soaked in a solution containing a mixture of 
potential ligands. Figure 7A is the initial Fo-Fe map. f igure 7B shows how the 
20 compound binds at the active site of urokinase. 

Figure 8 illustrates a hit for urokinase soaked in a solution containing a mixture of 
potential ligands. Figure 8 A is the initial Fo-Fe map. f igure 8B shows how the 
compound binds at the active site of urokinase. 

Figure 0 illustrates two additional hits for urokinase soaked in a solution 
25 containing a mixture of potential ligands. Figure C )A is the Fo-Fc map tor a strong ligand 
within the mixture, f igure l )B is the Fo-Fc map tor a weaker ligand within the mixture. 
1 he weaker ligand was detected only after the strong ligand was removed from the 
mixture. 

figure 10 illustrates the comparative crystal structures between a lead compound 
30 found h\ t r\ stal FAI) !M and an optimized follow-up compound. 
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Figure 1 1 illustrates hits that were identified tor VanX. 
Figure 12 illustrates a hit for urokinase. 

Figure 13 illustrates the crystal structure of compound 44 with lirmC*. 
Figure 14 illustrates the crystal structure of compound 45 with FrmC\ 

5 

Det a i 1 ed Description of the Inv ention 

CrystaLFAD IM provides an efficient screening method for identifying compounds 
that will bind to a target biomolecule. Such compounds can serve as leads or scaffolds to 
design ligands and/or drugs that have improved biological activity for the target. One 
10 must note that tighter binding ligands do not necessarily provide better biological activity 
or make a better drug, although this is the general rule. It is possible for a weaker binding 
ligand to provide better biological activity due to factors other than tight binding (e.g., 
selectivity, bioavailability). 

Crystallography has been used extensively to view receptor-ligand complexes for 
15 structure-based drug design. To view such complexes, known ligands are usually soaked 
into the target molecule crystal, followed by crystallography of the complex. Sometimes, 
it is necessary to co-crystallize the ligands with the target molecule to obtain a suitable 
crystal. 

Tntil now. crystallography has not been implemented to screen potential ligands 
20 despite the detailed structural information that it provides. Possible prejudices against 
screening compounds by crystallography include the belief that the method is too 
complicated or time consuming, that suitable crystals are difficult to obtain, that av ailable 
crystals could not tolerate soaking more than one compound (much less mixtures often or 
more compounds), that too much biomolecule would be needed, that it would be too time 
25 consuming to routinely mount crystals, and that constantly changing crystals on the x-ray 
goniometer would be too tedious. 

Howev er, currently av ailable technology has ov ercome main of these perceiv ed 
barriers. For example, at one time, molecular targets were only obtained horn natural 
sources and were sometimes unsuitable for crv stalli/ation due to natural degradation or 
30 glycosv lation. In addition, the natural concentration was often too low to obtain the 

amount of highly purified protein necessary for crv stalli/ation. With molecular biology. 



6308.1 'S. 1)1 



large amounts of protein may be expressed and purified lor crystallization. When 
necessary, the protein can even be re-engineered to provide different or better crystal 
forms. 

f urther, brilliant light sources (synchrotron radiation) and more sensitive detectors 
5 have become readily available so that the time required to collect data has been reduced 
dramatically from days to hours or even minutes. Furthermore, existing technologies 
which are less routine at this time, but may become routine soon, allow full data set 
collections in the order of seconds or even fractions of a second (e.g., Laue diffraction). J. 
Hajdu et aL Nature , v. 329, pp. 178-81 (1987). Faster computers and more automation 
10 software have greatly decreased the time required for data collection and analysis. Finally, 
the inventors have discov ered that it is possible to soak or co-crystallize mixtures of 
compounds to screen for potential ligands. Thus, as described below, crystallography is 
now a practical and feasible screening method. 

In CrystaFFAD IN1 . ligands for a target molecule having a crystalline form are 
15 identified by exposing a library of small molecules, either singly or in mixtures, to the 

target (e.g. protein, nucleic acid. etc.). Then, one obtains crystallographic data to compare 
the electron density map of the putativ e target-ligand complex w ith the electron density 
map of the target biomoleeule. The electron density map simultaneously provides direct 
evidence of ligand binding, identification of the bound ligand, and the detailed 3-D 
20 structure of the ligand-target complex. Binding may also be monitored by changes in 

individual reflections within the crystallographic diffraction pattern which are known to be 
sensitive to ligand binding at the active site. This could serve as a pre-screen but would 
not be the primary method of choice because it provides less detailed structural 
in formation. 

25 By observ ing changes in the lev el of ligand electron density or the intensity of 

certain reflections in the diffraction pattern as a function of ligand concentration either 
added to the crystal or in co-crystallization, one may also determine the binding affinities 
of ligands for biomolecules. Binding affinities may also be obtained by competition 
experiments. Here, the new compound! s) are soaked or co-crystallized with one of a 

30 series of diversely-shaped ligands of known binding affinity. If the known ligand appears 
in the electron density map. the unknow n ligands are weaker binders. Howev er, if one of 
the new compounds is found to compete tor the site, it would be the tightest binder. Bv 
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varying the concentration or identity of the known ligand, a binding constant for the 
CrystaLKAD IM hit may be estimated. 

The number of compounds screened is based upon the desired detection limit, the 
compound solubility and the amount of organic co-solvent the crystals will tolerate, Fxact 
numbers depend on each crystal. For example, for a typical crystal that tolerates 1% 
organic co-solvent, the sensitivity limit would be Kd< 1.5mM to screen 10 compounds 
simultaneously. For 20 compounds, the sensitivity limit would be Kd<().63mM. 
However, crystals that tolerate high organic co-solvents (e.g., 40%). can screen up to 50 
compounds within a detection limit of Kd< 1 .5mM. 

In the most general application of CrystaLEAD iM . the hit or lead compound is used 
to determine what compounds should be tested for biological activity in structure-based 
drug design. Then derivatives and analogs are obtained by traditional medicinal chemistry 
to find the best ligand or drug. 

Alternatively, the structural information collected in the screening process can be 
used directly to suggest analogs or deriv ativ es of the hit. This approach is illustrated when 
the activ e site is composed of one primary pocket surrounded by a variety of subsites and 
small pockets (Figure 1 ). Detailed structural information about how a compound is bound 
by the receptor is obtained simultaneously as a hit is detected. Such information is useful 
to the ordinary artisan for designing better ligands. P. Colman. Cur r. Opin. in Struct . 
Biolo gy, v. 4. pp. 868-74 ( 1 994); J. Greer et aL J. Med. Chem .. v. 37. pp. 1035-54 (1994); 
C. Verlinde et al.. Stru cture, v. 15. pp. 577-87 ( 1994). In particular, the hit identifies sites 
for analog synthesis which would permit access to the surrounding subsites and small 
pockets. This suggests the design of new compounds which better fit the activ e site. 
Furthermore, in cases where there is an existing structure-function relationship, activity 
enhancing substitution patterns may be directly transferred to the new lead scaffold at the 
3-D structural level. 

Another illustration (f igure 2) usually applies to a target that has two or more 
separate pockets that will accommodate fragments. 1 lere. the crystalline target is screened 
for ligands that occupy all of the sites either in sequence or simultaneously. Because the 
binding event is monitored bv v isualizing co-crvstal structures, the site of ligand binding 
is identified direct 1\ and there is no need for competition experiments to assure that the 
ligands indeed occupy ditterent sites on the protein. Screening separately allows for 
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ligands which bind to distinct pockets that overlap in their binding loci. Screening for the 
second in the presence of the first would detect cooperative binding at a second site. Once 
potential leads and a structure-activity relationship have been established, linkages 
between each of the sites may be designed using the detailed structural information and 
the fragment linking approach as previously described to produce novel, much more 
potent ligands. S.B. Shuker et ah. Science , v. 274. pp. 1 53 1 -34 ( 1996); C. Verlinde et al.. 
Structure, v. 1 5. pp. 577-87 ( 1 994 ). 

In a third application, the scaffold merging approach (not shown), the target active 
site is composed of two or more subsites. The crystalline protein is screened for ligand/s 
which bind via these subsites and the relative ligand binding orientation observed for 
multiple experiments. These ligands should bind by occupying one or more subsites and 
by overlaying the structures of multiple hits a core may be designed that will facilitate 
access to multiple subsites. This core would then serve as a new, novel and more potent 
lead compound which would also serve as the lead scaffold in the drug-design cycle. 

The CrystaIT:AD 1M linked-fragment approach experimentally implements the 
structure-based linked-fragment approach reported only at the computational level bv 
Verlinde et al. in J. Comput. Aided Mo l. Pes., v. 6. pp. 13 1-47 (1992). Verlinde et al. 
proposed ligand fragments based on mathematical calculations. The proposed fragments 
were then assayed for binding activity. If the fragments actually bound, their 3-D 
structures were determined by X-ray crystallography and a linker designed. By contrast. 
CrystaIT:AP IN1 concurrently detects the binding event and provides an experimentallv 
determined 3-P structure of the ligand-protein complex. The inv ention also prov ides for a 
process of determining the association constant between a target molecule and its ligand. 
The inv ention requires no special labeling of the target. Therefore, the target molecule, 
can encompass proteins, polypeptides, nucleic acids, nucleoproteins. or any other suitable 
target molecule, that is isolated from natural sources or by recombinant methods from anv 
suitable host system as developed and practiced by the ordinarv artisan. 

1 here are several advantages to crv stallographic screening. One important 
advantage is that the binding event is monitored directly so that the prohabilitv tor false 
positives is reduced to near zero. The crv stallographic data prov ide a three dimensional 
electron densitv "snap-shot" ot the ligand-receptor complex showing which compound 
binds and how it is bound. 
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The method is uniquely sensitive to structural changes in both the target and the 
ligand. Observ ing structural changes is critical in designing scaffolds which combine 
information from different ligand-target complex structures. One such example occurs 
when a protein changes structure in order to accommodate one ligand, but the structure 
5 change concurrently blocks the binding of a second ligand. Similarly, detecting structural 
changes is also important because if the primary scaffolds bind differently, it may not be 
possible to combine them into a larger scaffold. 

Since the binding event is monitored directly, CrystaLEAD IM does not require 
specially labeled samples, probes or target molecules which would be indirectly sensitive 
10 to ligand association. As long as one is able to obtain a crystal structure of the target, one 
can use CrystaLEAD™ to screen for ligands. 

If compound mixtures are suitably designed to be shape-diverse, the invention 
alleviates the need for de-convolution of libraries which are soaked as a mixture because 
the binding event is detected directly by examining the shape of the electron density at the 
15 binding site. Thus, the shape of the electron density identifies both the binding event and 
the compound identity directly. Alternatively, one can design the mixture to contain 
compounds with anomalous scattering atoms (e.g. Br. S) that can be identified by 
anomalous scattering techniques, f urther, because CrystaEEAD IM directly monitors 
binding, it is particularly well-suited for studying targets where no known ligand exist. 

20 Because the electron density function calculated in Crystal. HAD IM shows the "real 

space" of the crystal, one can focus directly on the region of interest. Thus, binding may 
be detected exclusively at the site of interest although the method is not limited to the 
active site. Binding at other sites, which complicates analysis in most binding assays, can 
be eliminated from consideration totally. 

25 Cry staff AI) ,M also provides for a method of concurrently monitoring binding at 

different locations. That is. for a target with more than one pocket, screening for a second 
site does not require screening in the presence of the first ligand. However, screening for a 
second site may be completed in the presence of the first ligand in order to discover 
cooperative ligands. 

^0 Crv staff A!) N1 is applicable for any target molecule for which a crv stal structure 

can be obtained. According to current literature, this includes anv soluble macromolecule 
with molecular weight between about 5000 and 200.000. However, this range expands 
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almost daily in response to technological advances. The method is also sensitive to a wide 
range of binding dissociation constants (<picomo!ar to molar). Using more sensitive CCD 
camera detectors, data may be collected in about < 4 hrs to about 4 hours with a rotating 
anode source. This permits the screening of thousands of compounds per detector per day. 
5 Using synchrotron sources, the number of compounds screened increase to multiple 
thousands per detector, and with Laue data collection methods and testing mixtures, 
Crv staLHAD IN1 data can be collected in a second or less, thus permitting thousands of 
compounds to be tested per day per beamline. Hence, multiple detectors or a single 
synchrotron beamline facilitates true high-throughput screening. 

10 Figure 3 outlines the invention. Crystals of the target molecule are exposed to one 

or more compounds by soaking the crystal or by co-crystallizing the target in the presence 
of one or more compounds. Then crystallographic data are collected, processed and 
converted to electron density maps which are examined for evidence of ligand binding. 
One way to detect ligand binding is to compare the structure of the original crystal w ith 

15 the structure of the exposed crystal. 

New targets may be crystallized by published conditions or by other methods well 
established in the art. Similarly, target structures may be available from databases such as 
the Protein Data Bank or could be determined by well established methodology. 
Advances in molecular biology and protein engineering expedite target crystallization 
20 while advances in data collection aid in rapid structure determination for targets of 
previously unknown structure. 

Crystals that are exposed to potential ligands by soaking require an empty 
accessible active site. Crystals with an empty active site may be obtained by various 
methods, including but not limited to: crystallization in the absence of a ligand: 

25 crystallization in the presence of ligand bound at a distal site: or crystallization in the 

presence of a non-covalent ligand that is easily diluted or exchanged from the target once 
the biomolecule crystallizes. By a novel method, the inventors have obtained crystals 
from a biomolecule by crystallizing the biomolecule in the presence of a degradable ligand 
at the active site and then degrading the ligand once a crystal is formed. Alternatively, it is 

30 possible to grow the crvstals in the presence of the compounds to be screened. Crvstals 

are allowed to equilibrate in the presence of the mixture, at w hich point the ligands bind as 
a function of their concentration and binding affinitv . 
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For the soaking method, the sensitivity of the method may be approximated by 
simple equilibria relationships beeause the coneentration of protein in the crystal may be 
calculated and the concentration of ligand is a known quantity. For example, the 
concentration of a 25.000 MW protein (urokinase) in a crystal is calculated as follows: 
5 there are 4 molecules in the orthorhombic unit cell (all angles C H)°) which has a volume of 
55 X 53 X 82 A^; using Avogadro's number, the concentration is 28 mM. Therefore, a 
mixture of compounds having a 6 mM concentration for each ligand will result in a 
calculated sensitiv ity limit of Kd < 1.5 m\l (assuming a detection limit of about 80% 
occupancy in the crystal). 

10 Soaking mixtures of compounds also raises the question of multiple occupancy 

(more than one ligand binding to the site of interest). For cases of multiple occupancy 
where the ligands are bound in different pockets (see Figure 2). resolution by 
CrystaFFAD IM is easy because the binding at the separate sites can be distinguished 
individually by the electron density maps. For the scenario where different ligands 

15 compete to occupy the same site, one may use a simple competitive inhibition model to 
calculate the requirements for such binding, f rom empirical observation, it is believed 
that crystallography can resolve situations where the occupancy of one inhibitor is 80% 
and another 20%. Therefore, a ratio of binding affinity that is greater than four would 
result in an apparent occupancy by only the higher-affinity ligand. In the unlikely case 

20 where the ratio of binding constants of tw o compounds in the mixture are less than four, 
the resulting electron density w ould be a w eighted average of the two separate densities 
and might be difficult to identify. Accordingly, it would be necessary to conduct further 
soaking experiments to de-eonvolute the mixture (e.g.. looking at each compound 
individually in separate crystals) only where the ratio of binding affinities is less than four. 

25 This would still be worthwhile and efficient because it already determines that at least two 
hits are present in the mixture. 

Compounds to be screened are formed into libraries, f or the purposes of this 
discussion, libraries are large mixtures of compounds ( 1 00- 1 0.000 * ) and may be general 
or structure-directed. A general library is random, i.e. fully diverse in si/.e. shape and 
30 functionality. A structure-directed library is aimed at a particular functional mixture or 
subsite in the active site of the target molecule (e.g.. a library where all compounds 
contain a carboxv late functionality to be directed towards a positiv e charge in the target 
active site ). 
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In a preferred embodiment, either type of library is divided into smaller groups of 
shape diverse mixtures. Hence, a mixture is defined as a subset of the library which may 
be soaked or grown into the crystal. The mixture is determined to be shape diverse by 
visual inspection of the two dimensional chemical structures or computationally by 
programs. Shape div ersity of the mixture permits a bound ligand to be identified directly 
from the resultant electron density map (see Figure 4). This eliminates the need for 
follow-up experiments to determine which compound of the mixture is a hit (bound to the 
target). 

If the test compounds are water soluble, ty pical buffers and precipitant solutions 
used in crystallization can be used to solubilize the mixtures and soak them into the 
crystal. Less water soluble compounds are dissolved individually to a final concentration 
of 2M in a suitable organic solvent. In one embodiment, they are dissolved in 100% 
DM SO and stored at 4° C. and mixed by mixing the DMSO stocks before exposure to the 
crystal. These mixtures would service most crystal systems where the conditions for 
crystal growth do not include organic reagents. The compounds would be typically 
soaked to a final DMSO concentration of 1-1 0% and allowed to equilibrate with the 
crystalline protein for a pre-determined amount of time (4-24 hrs). Under this scenario, 
each crystal is exposed to multiple compounds per soaking mixture. Some crystal growth 
conditions can include a high concentration of organic solvent (40-50%) w hich are 
typically alcohol derivatives. In this case, the compound libraries may be dissolved in the 
crystallization organic solv ent which would allow a final co-solvent concentration of 40- 
50% for the soaking experiment. Here, the number of compounds per soaking mixture 
could increase. 

After soaking, each crystal is exposed to a cryoprotectant such as 5-20% glycerol 
in the soaking mixture, mounted in a nylon loop and placed on the X-ray unit under a 
nitrogen cold stream ( 160K). The crystal studies may also be performed at room 
temperature or other suitable conditions as necessary for the stability of the crystals. 
.Automated crystal mounting and changing equipment may be used to accelerate this step 
of the process. 

Cry stallographic data are collected and processed where each reflection (spot) on 
the diffraction pattern is assigned an index (h.Lb and the intensity is measured as standard 
in the field. X-ray sources may be laboratory x-ra> generators or high brilliance 
synchrotron sources that permit diffraction data collection at very high speed. 
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(Molecular Simulations Inc.. Quanta Generating and Displaying Molecules. San Diego: 
Molecular Simulations Inc.. 1997) can automate this process. 

As the ability to measure or process diffraction intensities improves, one may not 
need to perform the comparison on electron density maps. One may detect binding by 
5 simply comparing the diffraction patterns of the exposed crystals with the unexposed 

crystals. Therefore, one needs to create an electron density map only if a binding ev ent is 
detected in this pre-screening process. 

As shown above. CrystaLEAD IN1 can be applied to any biomolecular target for 
which a crystallographic structure can be obtained. Because of its broad applicability, it is 
10 best illustrated by the examples below. 

The urokinase and VanX examples represent two scenarios for the use of 
CrystaFFAD 1 M . For urokinase, the re-engineered microUK (|aUK) crystals diffract very 
well and are of a high symmetry space group. By contrast. VanX crystals diffract more 
weakly and with lower symmetry. Thus. VanX requires greater data collection time. In 

15 addition, urokinase crystals have one molecule in the asymmetric unit, while VanX has 
six. The larger asymmetric unit requires collection of higher resolution data and makes 
map inspection more tedious. I lowever. in the case of VanX. no non-substrate mimetic 
binders were known before those discov ered by Crystal. FAD IM . Therefore. 
CrystaFFAD IM provided a novel non-peptidic lead compound to be fed into the drug- 

20 discovery cycle. For urokinase, CrystaLHAD IM provided a novel primary scaffold. 

Applicants were able to rapidly increase the potency of the primary scaffold by using 
existing SAR and crystal structures to design a higher-affinity derivative with improved 
bioavailability over known urokinase ligands. 

Howev er, these examples illustrate the preferred embodiment of the present 
25 invention, and do not limit the claims or the specification. The ordinary artisan will 

readily appreciate that changes and modifications to the specified embodiments can be 
made without departing from the scope and spirit of the invention. Finally, all citations 
herein are incorporated by reference. 
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EXAMPLES 

Example 1 Urokinase 



Lrokmase. a serine protease. ,s strongly associated with tumor cells. I Urokinase 
nates piasmmogen ,nto plasmm which, in turn, activates the matrix metailoprotea 1 
I lasmtn and the metal.oproteases degrade the extracellular matr, and promote tumo 
g^wth and metastasis. Thus, inhibit that specific^ target urokmaL mav ser Ts 

effective anti-cancer agents. ' 

Human pro-urokinase cons.sts of 41 1 amino acids (Figure 5, Verde el al Proc 
NttJAa!iScL.v.» 1(5) . pp. 4727-31 , 1<»4): Nagai e, a,..Ge^ v ^ ~ 

enzyme becomes ,wo chains connected by a single disulfide bridge (Cvs'^Cvs^. The 
A-cha.n (residues ,., 5 „ con.a.ns an ECF-.ik, domain and a kringle domain. The B 

chain re*iuiiPc KQ ,11 n : . . men 

.. . , 1K ca|a|v „ c si;rine i„ cl ,b. llio „ 

ol urok.nase rcsnl.s in an additional pro.eoly.ic cleavage a, „,e I v,'" , vs'«' „ i K , 

^^^^^ co^^^Jt^r 

e covalen, ,n„,b,,„r G, U -Oly. A r g ohlorome.hy, ke.one ohrained bv Spra ,„n" , 
fe. v. 3. pp. «,.,„, W5) . and were shinM , ,„ dilTraa ^ resQ = £ 

„ I °" S0UrCe - " 0lVeVCr - '" e P °" r dil ' ,raC "°" °<»- -s,a,s 



iillbLC rvsuil Pro Rara^onjt Stnicture 
* >f ■ ,'" 'T" K "' hun,an urok.nase »as re-e„ g ,„eered ,„ consis, „ nlv 

I Ins lorn, .„ urokmase , M , K, „ as s „ mvn ,„ hc ,.„,,,. ^ ^ ^ - - 

* m a crysl.,1 , nn company „„„ ( r> su„ ,-A„'« ,s a , also. , s. ,,„e nl Nt , 
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Preparing Vector Construct p BC-LMW-l _lK^Alal_ ^ 

Mutants of human I K were cloned into a dicistronic bacterial expression v ector 
pBCTK12. Pilot-Matias et al.. Gene. v. 128, pp. 219-25 ( 1993). The following oligo 
5 nucleotides were used to generate various UK mutants by PCR: 




The initial cloning of a low molecular weight UK, hereinafter designated LMW- 
UK (L 144 -L 4! 1 ) was performed using human UK cDNA as template and SEQ ID NOs: 1 
and 2 as primers in a standard PCR reaction. The PCR amplified DNA was gel purified 
and digested with restriction enzymes Sail and IIinc{\\\. The digested product then was 
ligated into a pBCFK12 vector prev iously cut with the same two enzymes to generate 
expression vector pBC-LMW-UK. The vector was transformed in DH5u cells (Life 
Technologies, Gaithersburg, MD). isolated and the sequence confirmed by DNA 
sequencing. The production of LMW-UK in bacteria was analyzed by SDS-PAGE and 
zymography, Granelli-Piperno et al., J. Exp. Med-- v. 148, pp. 223-34 ( 1978). which 
measures plasminogen activation by UK. That LMW-UK was expressed in coli. and 
that it was active in the zymographic assay was demonstrated by commassie blue stained 
gel. 

The success of the quick expression and detection of LMW-UK in /;. coli made it 
possible to perform mutagenesis analysis of UK in order to determine its minimum 
functional structure. One mutant having a C'ys"" to Akf replacement was made with 
SIX,) ID Nos: 2 and 3 by PC R. 1 he PC R product was cut with AviU and Himl III. and 
used to replace a . Iw'II and Him! III fragment in the pBC-l \1Y\ -I K construct. The 
resulting pBC-I AIYV-I K-.\la' ' construct was expressed in /'. coli and the product 
shown to be active in /> mograpln . 
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Cloning and Lxpressing jjUK (UK(l hg -K 4 " 4 )A" *Q % ' ) in Baculovirus 

faUK (UK amino acids Ile'^-Lys 404 that contain AUr ; C iln ( ^ ) was generated by 
PCR with the following oligonucleotide primers: 




To mutate the only glycosv lation site (Asn' ") in UK, oligonucleotide primers SEQ 
ID NOs: 4 and 6. and SLQ ID NOs: 5 and 8 were used in two PCR reactions with pBC- 
LMW-UK-Akr as the template. The two PCR products were cut with restriction enzyme 
Pvu II. ligated with T4 DNA ligase. and used as template to generate LMW-UK-A 2 ' g -Q'*°\ 
In the meantime, native UK leader sequence was fused directly to Ile Iy; by PCR with SHQ 
ID NOs; 7 and 9 using native UK cDNA as the template. 

This PCR product was used as a primer, together with SEQ ID NO: 8, in a new 
PCR reaction with LMW-UK-A 2 '' g -Q" ,n: DNA as template to generate yiYK cDNA. f^iUK 
was cut with She I and ligated to a baculov irus transfer vector pJVPIOz cut with the same 
enzyme. Vialard et al. J. Viro logy, v. 64( 1 ): pp. 37-50. ( 1990). The resulting construct. 
pJYPlOz-uUK was confirmed by standard DNA sequencing techniques. 

Construct pJVP10z-|.iUK was transfected to Sf9 cells by the calcium phosphate 
precipitation method using the BaculoCiold kit from PharMingen ( San Diego. CA). Active 
uUK activity was detected in the culture medium. Single recombinant v irus expressing 
|.iUK was plaque purified by standard methods, and large stock of the v irus w as made. 

Large scale expression of uUK was made in another line of insect cells. High-f iv e 
cells (Invitrogen. Carlsbad. CA). in suspension growing in Lxcel 405 serum-free medium 
(JRII Biosciences. LeneXa. KS) in 2 liter flasks, shaking at SO rpm. 28 C. High-l ive cells 
were grown to 2 X 10 M cells ml. recombinant ul K v irus was added at 0.1 multiplicity of 



6308. US. 1)1 



-17- 



infcction, and the culture was continued tor 3 days. The culture supernatant was harvested 
as the starting material for purification. The activity of jal. T K in the culture supernatant 
was measured by amidolysis of a chromogenie UK substrate S2444. w hich was at 6-10 
mg liter. Claeson et aL 1 laemo stasis. v. 7. p. 76 ( l c )78). 

5 

[expressing iiliK in Pichia past oris 

1 o express |al ] K in Pichia. an expression vector with a synthetic leader sequence 
was used. 1 he Pichia expression vector. pHil-D8. was constructed by modifying vector 
pHil-D2 (Invitrogen) to include a synthetic leader sequence for secretion of a recombinant 

10 protein. The leader sequence 5*- 

ATGTTCTCTCCAATTTTGTCCTTGGAAATTATTTl ^A GCTTTGGCTACTTTGCAAT 
CTGTCTTCGCT CAGCCAGTTATCTGCACTACCGTTGGTTCCGCTGCCGAGG 
GATCC-3* (SEQ ID NO: 10) encodes a PHOl secretion signal (indicated by the single 
underline) operatively linked to a pro-peptide sequence (indicated in bold) for KHX2 

15 cleavage. To construct pHil-D8, PGR was performed using pHil-Sl (Invitrogen) as 

template since this vector contains the sequence encoding PI IOK a forward primer (SFQ 
ID NO: 1 1 ) corresponding to nucleotides 509-530 of pHil-Sl and a reverse primer (SHQ 
ID NO: 12) having a nucleotide sequence which encodes the latter portion of the PHOl 
secretion signal (nucleotides 45-66 of SHQ ID NO: 10) and the pro-peptide sequence 

20 (nucleotides 67-108 of SHQ ID NO: 10). The primer sequences (obtained from Operon 
Technologies. Inc. Alameda. OA) were as follows: 



Amplification was performed under standard PGR conditions. The PGR product 
(approximately 500 bp) was gel-puritled. cut with Iilp\ and EcolU and ligated to pHil-D2 
cut with the same enzymes. The DNA was transformed into E coli 1 IB 101 cells and 
positive clones identified by restriction cn/yme digestion and sequence analysis. One 
clone having the proper sequence was designated as pllil-l)8. 
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l_iUK was eluted with a NaCI gradient from 10 m\l to 250 mM. The heparin column 
eluate of nUK (-50 ml) was applied to a benzamidine-agarose (Sigma. St. Louis, MO) 
column (40 ml) equilibrated with 10 mM Hopes buffer, pH7.5, 200 mM NaCI. The 
column was then washed with the equilibration buffer and eluted with 50 mM NaOAc. pH 
5 4.5, 500 mM NaCI. The jaCK eluate (-30 ml) was concentrated to 4 ml by ultrafiltration 
and applied to a Sephadex G-75 column (2.5x48 cm; Pharmacia w Biotech, Uppsala. 
Sweden) equilibrated with 20 mM NaOAc, pH4.5. 100 mM NaCI. The single major peak 
containing yiVK was collected and lyophilized as the final product. The purified material 
appeared on SDS-PAGE as a single major band. 

10 High-quality faUK crystals facilitated determination of its apo-three-dimensional 

structure by X-ray crystallography to 1 .OA resolution. Crystals were obtained by the 
hanging drop vapor diffusion method. Typical well solutions consisted of 0.1 5M M2SO4, 
20% polyethylene glycol MW 4000 and succinate buffer pH 4.8-6.0. On the cover slip, 2 
|il of well solution w ere mixed w ith 2 jal of protein solution and the slip sealed over the 

15 well. Crystallization occurred at approximately 18-24°C within 24 hrs. The protein 
solution contained 6 mg ml (0.214mM) jiUK in 10 mM citrate pH 4.0. 3 mM e-amino 
caproic acid p-earbethoxyphenyl ester chloride (inhibitor) with l°o DMSO co-solvent. 
The inhibitor utilized in the co-crystallization is believed to acylate the active site serine 
195 and is subsequently deacylated enzv matically. because, the 3-D X-ray structure of 

20 crystals grown in the presence of this compound show no inhibitor remaining in the 

enzyme active site. Menegatti et al.. J. KnzAme Inhi bition , v. 2. pp. 249-59 ( 1989). The 
only density present is that due to bound solvent molecules. Because yiVK will not 
crystallize in the absence of the inhibitor, the meta-stable inhibitor:! 'K complex is 
believed to be the crystallization entity. Importantly, the resultant fat'k crystals are 

25 composed of enzyme with an empty active site which is the ideal case for implementation 
of CrystaLHAD ,M . 

Crystals obtained under these conditions belong to the space group P2]2]2] with 
unit cell dimensions of a-55.l6A b- 53.00A c S2.30A and u- (C-y- 9()\ They diffract to 
beyond 1 .5 A on a rotating anode source, f urther, a 1 .OA resolution native data set has 
30 been collected at the Cornell High f nergy Synchrotron Source in Ithaca. New York. The 
crystal structure was determined by the molecular replacement method using the AMORf 
program. Nava/a. T Acta Cryst.. A50: 157-1^3 ( 1994). with the low-resolution urokinase 
structure as the search probe. Spraggon et al.. Structure, v. 3. pp. 681-691 ( 1995); PDB 
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cntry 1LMW. The structure was refined using the XPLOR program package. A. Brunger, 
X -PLOR (version 2.1) Manual . Yale University. New f laven CT (1990). 

Screening tor W eak I 3 a se s 

5 The jiUK was screened against a structure-directed library in order to find a novel 

primary scaffold which would have favorable pharmacokinetic properties. Since the 
urokinase active site is composed of one primary pocket that contains a free carboxylate 
moiety in the form of an aspartic acid (Asp ). most well-known scaffolds are strongly 
basic and contain amidine or guanidine moieties. The basic group has been found to 
10 hydrogen bond salt-link with Asp 1Kg . This can be a problem pharmacologically since 
strong bases are known to decrease oral bioavailability. Accordingly, a weakly basic 
library containing compounds that were not previously known to be urokinase binders was 
selected. 

A weak base library containing 61 compounds with pKa between about 1 and 9 
15 was located in the available chemicals directory (ACD). The library was broken down 
into 9 mixtures of about 6 to 7 shape-div erse compounds, as determined by visual 
inspection of the two dimensional chemical structure. The compound mixtures were 
screened by the method described above. Specifically, each compound was dissolv ed in 
100% DMSO to a final concentration of about 2X1 (or saturation for the less soluble). 
20 Equal volumes of each of the 6 or 7 compounds comprising the mixture w ere mixed to a 
final indiv idual compound concentration of 0.33M. Single [iVK crystals were placed in 
50ml of 27% PHG4000. 15.6mM succinate pH 5.4. 0.17M f i : S() 4 and 0.5-0. 8ml of the 
compound mixture added to give 1 to 1 .6% DMSO and 3.3 to 5.2mM final individual 
compound concentration. Cnder these conditions the sensitiv ity of the experiment is 
25 expected to detect binders with kd -1 .OmM. Crystals were allowed to equilibrate for 
about 8-24hrs. 

Data were collected on a Rigaku RTP 300 RC rotating anode source with a 
RAXIS1I or MAR image plate detector. Tvpical data consisted of 45-50 2^ oscillations 
with 2-5min exposures. Tvpical data were 70-90% complete at 2. 0-3. OA resolution with 
30 merging R-faetors of 1 3-20%. Hence, the data qualitv ranged from fair to poor due to the 
rapid data collection protocol. However, this qualitv of data was shown to be adequate for 
the detection of binders primarily due to the high qualitv of the starting model w hich had 
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been refined to 1 .5 A resolution (R=20.7% Rf re e = -5-3%). Data were processed by the 
DFNXO program package. Otvvinowski et aL Methods in Fnzym ology. 276 ( 1996), and 
the electron density maps calculated by the XPLOR package. 

Flection density maps were inspected on a Silicon Graphics INDIGQ2 workstation 
5 using the QUANTA 97 program package (Molecular Simulations Inc.. Quanta Generating 
and Displaying Molecules , San Diego: Molecular Simulations Inc., 1997). The shape of 
the density at the active site was visually identified as resulting from one (or more) of the 
compounds in the mixture indicating a positive hit or from ordered water molecules 
indicating the absence of binding. For experiments w hich resulted in a positive hit. the 

10 appropriate compound was visually moved into the electron density . The electron density 
maps were also checked for any changes in the protein structure and if observed, the 
appropriate modifications were made. Hence, after the map inspection/compound fitting 
step, the three-dimensional structure of the compound:protein complex w as know n. The 
urokinase example utilized visual movement of the compound into the density because the 

15 screening was still on a small scale. When expanded to larger scale compound screening, 
commercial programs such as the XFIT module of QUAN TA w ill facilitate automatic 
fitting of the compound to the density. 




X-00 1 42753-7 1 -9 427 1 2-64- 1 

4 5 6 



f igure 6 shows an example of a positi\e hit. The compounds screened are 
numbered 1 through 6 and the To-Tc electron densit> map at the active site is shown at 
Figure 6 A. The shape of the density identified the binder as compound 5. figure 6B 
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shows the detailed binding mode of the compound in the primary specificity pocket as 
obtained directly by interpretation of the Crystal. I- AD 1N1 electron density map. The amino 
nitrogen hydrogen bonds with the Asp lv ' carboxyl and the pyrimidyl nitrogen hydrogen 

1 1 s 

bonds with a backbone carbonyl (Gl\ " 1 ). The structure also shows that the ideal site for 
modification would be at the pyridyl methyl. 
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Another mixture of compounds (compounds 7 through 13) did not produce any 
hits. I he resulting electron density map after soaking this group did not correspond to that 
of any of the tested compounds in this mixture. Instead, they correspond to bound solvent 
molecules. See Fijzure 6C\ 
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615-45-2 
20 

Figure 7 shows another example of a positive hit. Of the seven compounds 
screened (14-20), the Fo-Fc map shown in Figure 7A indicates that compound 19 is 
bound. 1 he binding mode depicted in Figure 7B shows that the 2-amino is hydrogen 
bonding with the Aspl89 side chain and that the 8-hydroxyl is an ideal site for substitution 
in order to access the adjacent hydrophobic sub-pocket (denoted as SI p in Figu re 7B). 
Figure 8 represents another hit where compound 22, 5-aminoindole. ( Figure 8A) was 
found to bind to urokinase with the amino group hydrogen bonding with Aspl89 (F igur e 
8B). Compounds screened were compounds 21-27. 
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71026-66-9 
27 



Figure 9 shows an example where two compounds from the same mixture 
5 (compounds 28-34) were found to bind without multiple occupancy problems. In the 
initial experiment where the crystal w as soaked in the presence of the entire compound 
mixture, compound 28 was found to bind ( Figure 9A). In addition, when the weaker 
binding compound 31 was soaked individually (based upon previous structure activity 
relationships established through Cr\ staLFAD IM ) it was also found to bind ( Figure 9B). 
10 In a more typical application of the method, a library would be re-soaked in the absence of 
the tighter binder in order to detect weaker binders in the mixture, if desired. 
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I able 1 summarizes the inhibition constants tor each of the Crystaf f AD IM hits as 
determined by pyroGlu-Cily-Arg-pNA/HCl (S-2444, Chromogenix) chromogenic activity. 
Assays were completed at both pH 6.5 (0.1 M NaP0 4 ) and 7.4 (5()mM I ris). Other 
conditions of the assay were 15()m\l NaCl. 0.5% Pluronic f-68 detergent. 200mM S- 
5 2444, w ith a final DMSO concentration of 2.5%. The Km of the substrate w as determined 
to be 55|aVL 



fable 1 : Inhibition constants and pKa for hits detected by CrystaLKAD' 

Compound (CAS #) Ki Ki pKa 

(pH6.5) (pH 7.4) 

5 (42753-71-9) »500|aM »500nM 6.0* 

19 (70125-16-5) 56yiM 137|iM 7.3 

22 (65795-92-8) 200|aM >500|iM 6.0* 

28(580-22-3) 71-tM 136|iM 7.3 

31 (1603-41-4) »500|aM > oOOuM 7.0* 
10 • indicates estimated pKa 



Based upon the activity and structural information, compound 19 was chosen as 
the lead compound. Crystallographic information indicated that substitution at the 8- 
position should allow access to the adjacent hydrophobic pocket (SI ft) pocket and therebv 

15 result in an increase in potency. Based upon crystallographic and binding information 
from an amidine-based series, compound 35 was synthesized (the 8-aminopyrimidinvl 
analog of compound 19). This modification resulted in about a 200 fold increase in 
binding potency at pll 6.5 (Ki pi 17.4= 2.5(.i\l; Ki pH6.5 r: ().32|aM). The experiment 
indicates that Crystal. HAI) 1M can provide both a lead scaffold and the detailed structural 

20 information necessary to elaborate that scaffold through structure-based drug design into a 
more potent compound. 
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35 

In Figure 10, an ov erlay of the crystal compound 35:urokinase and the parent 
compound 19 are shown. The overlay shows that the aminopyridine ring is hound in the 
5 hydrophobic sub-pocket (SI P) pocket as predicted and that this substitution results in 
movement of the quinoline ring tow ards this site. 

Compound 35. the 8-aminopyrimidin\ 1-2-aminoquinoline, was also tested for oral 
bioavailability. Compound 35 was determined to be 30-40% orally bioavailable in the rat 
when administered at a lOmg/kg dose. Hence, successful implementation of 
10 CrystaLFAD IN1 resulted in a novel lead scaffold which through one cycle of structure- 
based drug design produced a compound having a 200-fold increase in potency, and was 
found to be orally bioavailable. 



Example 2: VanX 

15 

Vancomycin is the drug of choice for infections caused by streptococcal or 
staphylococcal bacterial strains that are resistant to (^-lactam antibiotics. However, strains 
of vancomycin resistant bacteria have now been found for this drug of last recourse. Some 
investigators have associated VanX, a metalloproteinase. with vancomycin resistance. 

20 VanX is part of a cascade that results in replacement of the terminal I)-Ala-I)-Ala moietv 
of the bacterial peptidoglycan chain (the binding site for vancomycin) with a D-Ala-D- 
lactate. This results in a 1000-fold decrease in vancomycin binding. The only known 
inhibitors of VanX are peptides or peptide derivatives, such as phosphonate or phosphinate 
analogs of the I)-Ala-D- Ala substrate. As such, they are not suitable drugs because the\ 

25 are metabolized and or degraded in vivo. Initial attempts to find suitable drugs by normal 
screening methods did not uncover a suitable tigand. Subsequently. .Applicants turned to 
Crystal FAI) 1M to find a non-peptide lead compound for drug development towards a 
treatment for these resistant strains. 
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VanX Prepara tion 

K. coli W3 1 10 containing plasmid pCiWl. in which the vanX gene is under control 
of the IPTG-inducible tac promoter, was grown at 37 °C in LB medium containing 
ampicillin ( 100 ng/ml) to an absorbance of about 1 .3-1 .5 at 595 nm. Then IPTG was then 
added to a final concentration of 0.8 mM. and the cells were grown for an additional 1 .5 
hours. 

Cells were harvested by centrifugation at 6000 rpm for 10 min. Then, the pellet 
was resuspended in ice cold 20 mM Tris-HCl (pH 8.0) containing 0.01% NaN> 1 mM 
MgCl 2 . ImM PMSF, 1 mM DTT (Buffer A) and 25 units/ml of benzonase (Nicomed 
Pharma. Copenhagen, Denmark). The cells were lysed by the addition of 0.1 micron 
zirconia ceramic beads to the lysate mixture (1:1 v:v) with a 1 -3 minute run in a Bead 
Beater (Biospec), an ultrasound bead mill. The Bead Beater was run with an ice-packed 
reservoir to maintain a chilled lysate. Then, the lysate was decanted away from the settled 
glass beads. The beads were then rinsed with 1-2 volumes of lysis buffer, and the washes 
were then pooled with the original lysate. The lysate was centrifuged at 25()00g for 30 
minutes to settle cell debris. The supernatant was dialyzed overnight at 4 °C in 50mM 
Tris-HCl, pH 7.6, 1 mm KDTA. and ImM DTT (Buffer B). 

Thereafter, the dialyzed lysate was loaded onto a Q-sepharose fast How column, pre- 
equilibrated in Buffer A at a rate of four millimeters per minute. The column was exhaustively 
washed with the Buffer A followed by a linear gradient of Buffer B to Buffer B+0.5 M NaCl. The 
active VanX fractions from this step were pooled, concentrated and then applied to a Superose-75 
column in Buffer B. VanX fractions from the Superose column run were then applied to a Source-Q 
column in Buffer A at a How rate of 2 ml min. The column was washed w ith starting buffer for 
several column volumes. Then the VanX protein w as eluted off w ith a shallow gradient of Buffer A 
to Buffer A-25mM NaCl. The active VanX fractions from this final step were concentrated to a 
final concentration of approximately 15 mg ml in Buffer A with Amicon filters. I'nless otherwise 
specified, the foregoing procedure was run at 4 C. As purified, the VanX protein was 
approximate^ 05° o pure and readily crystallized. 
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VanX Crystal Structure 

The crystal structure of VanX was determined at 2.2 A resolution by multiple 
isomorphous replacement. Bussiere et <//.. Molecu lar Cell , Vol. 2. pp 75-84 ( 1998). The 
recombinant protein obtained above was crystallized in the space group P2i by the sitting 
drop vapor diffusion method. Typical crystals had unit cell dimensions of a=83.4A, 
b=45.5A. c=171.4A. a=y^90°. [5=104° with six molecules in the asymmetric unit. Typical 
well solutions consist of 0.1 M Mes pH 6.4, 0.24M ammonium sulfate, and 20% PMME 
5000. On the sitting drop microbridge (Hampton USA), 2ml of protein are mixed with 
2ml of well solution and the chamber sealed with a cover slip. Crystallization occurs at 
1 8°C\ and the crystals grow to full size in about 2-3 days. The protein solution is 
composed of 12-1 5mg ml (0.5-0.6mM) VanX in lOmM Tris, 15mM DTT, pH 7.2. The 3- 
D structure for crystals grown under these conditions show an empty active site making 
this a system highly suitable for application of CrystaLKAD IM . 

The VanX active site has an extended pocket capable of accommodating the D- 
Ala-D-Ala substrate. The pocket also contains a catalytic zinc. Thus, for this case. VanX 
was initially screened against zinc directed libraries in order to find multiple binding 
scaffolds which could be merged into a single lead compound. Three libraries utilizing 
amino-acid. thiol, hydroxamic acid or earboxylate moieties directed towards zinc were 
screened. 

Screening 

1 he amino acid library consisted of 102 compounds of optically pure 
commercially available natural and non-naturally occurring amino acids. The library was 
divided into 12 mixtures of S- 10 shape-diverse compounds and screened by the method 
described above. Specifically, each compound was dissolved in 100% DMSO to a final 
concentration of 2M (or saturation for the less soluble), fqual volumes of each compound 
of each mixture were mixed to a final individual compound concentration of 0.3 3 \1. 
Single VanX crystals were placed in 50ml of 0. 1 M Mes pi 1 6.4. 0.24M ammonium 
sulfate. 20° n FMMI'. 5000 and 0.5-0. 8ml of the compound mixture added to give 1 to 
1 .6% DMSO and 3.3 to 5.2mM final individual compound concentration. Crystals were 
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allowcd to equilibrate for 3-4 hrs. The thiol, hydroxamic and carboxylate libraries were 
prepared and sereened in a similar manner. 

Data were eolleeted on a Rigaku R I P 300 RC rotating anode source with a 
RAXISII, MAR image plate, or MAR CCD detector. For the image plate systems, typical 
5 data consisted of 90 1.25° oscillations with 15min exposures while for the CCD 100 1.0° 
oscillations were exposed for two minutes. Typical usable data were >90% complete at 
2.6-2.8A resolution with merging R-factors of 10-20%. This was required to adequately 
visualize and identify inhibitors in the Fo-Fc or 2Fo-Fc maps. For these maps, the starting 
model had been refined to 2.1 A resolution (R=25% Rf re e=28%). Data were processed by 

iO the DFNZO program package and the electron density maps calculated by the XPLOR 
package. In the presence of some compounds of the carboxylate library, the space group 
was shown to shift from P2] to C2 (a=170.6A. b=47.5A. c=83.6A. a=y=90°. 0=104°). 
For this form, the asymmetric unit contained a trimer thereby reducing the number of 
degrees of freedom so that lower resolution data (3.0A) were adequate for visualization of 

15 binding. 

Electron density maps were inspected on a Silicon Graphics INDIGQ2 workstation 
using QUANTA 97. The shape of the density at the activ e site was visually identified by 
the shape of one or more of the compounds in the mixture to indicate a positive hit or by 
ordered water molecules indicating the absence of binding. For experiments w hich 
20 resulted in a positive hit. the appropriate compound was visually moved into the electron 
density. The electron density maps were also checked for any changes in the protein 
structure, and if observ ed, corresponding modifications were made in the structure. 
Hence, after the map inspection compound-fitting step, the detailed 3-D structure of the 
compound:protein complex was known. 

25 

OH ^ V OH 

580-22-3 329-89-5 132-32-1 

36 37 38 
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1 603-4 1 -4 1 3 7-09-7 243 1 3-88-0 
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5 Currently 6 hits have been detected in the VanX screens (compounds 36-41 ). 

Figure 1 1 shows the binding mode of representative hits. In all cases, the electron density 
shape identified the binding compound. Figure 1 1 A shows compound 39 bound with the 
carboxylate coordinating to the active site zinc. Figure 1 1 B. shows compound 36 bound 
with the carboxylate pointing towards the active site zinc. In Figure 1 1C\ compound 37 

10 was also found to bind through the carboxylate. The binding of compound 39 and 
compound 41 (not shown) suggests that the active site zinc prefers coordination of a 
carboxylate over a free thiol. This led to screening of a carboxylate library where 
additional hits were found. In all cases, the compounds were screened in mixtures of 7-10 
and the hit directly identified by the shape of the electron density map. These hits are fed 

15 directly into the structure-based drug design cycle in a manner similar to that described for 
the urokinase example. 



Example 3: Scre ening with Mixtures of 1 00 C omp ounds 

20 

In order to increase the number of compounds that may be screened per unit time 
by the CrystaFFAD 1N1 method, a preferred embodiment of the method would be to screen 
mixtures of 100 compounds rather than mixtures of 10. The advantage of this method is a 
higher compound throughput with a concurrent lowering of the sensitivity of the hit 
25 detection. In addition, since only the most potent compound in a mixture will bind, 
weaker hits may be missed. W hen a general library, for example, one which is fully 
diverse in si/e. shape and functionality, is screened by Crystal .FAI)' M . the hit-rate is 
expected to be low. I herefore. a more coarse screen is warranted. In addition, since the 
hits from this screen would be the more potent binders, they could serve as starting 
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scaffolds for structure-based drug design. Since the compound mixture will be composed 
of 100 compounds, the mixture should be carefully designed in order to ensure that all 
members would be diversely shaped enough to eliminate the need for deconvolution. 
Hence, upon hit detection, some deconvolution may be necessary to identify the hit. 

5 

To test this particular method, a compound known to bind to jaUk was added to a 
group of 100 compounds. This known binder, compound 19. was originally discovered 
by the CrystaLEAD IM method and shown to bind to jiUK with a Ki of 56jaM at pH 6.5 
and 137|aM at pU 7.4. The 100 compound mixture was constructed by mixing 10 

10 mixtures of 10 compounds. Specifically, each dry mixture of 10 was dissolved in 100% 
DMSO to a final concentration of about 80-240mM(or saturation for the less soluble). 
Equal volumes of each of the mixtures of 10 compounds were mixed to a final individual 
compound concentration of 8.0-24.0mM and the mixture spiked with a 100% DMSO 
stock of compound 19 such that the final concentration was I8.0mM. Single (iUK crystals 

15 were placed in 50^1 of 27% PKG4000. 15.6mM succinate pH 5.4, 0.1 7M Li : S0 4 and 

().5jaLof the compound mixture added to give 1% DMSO. The final concentration of each 
compound in the soak experiment ranged from 80-240(.iM. the concentration of compound 
19 was 180(iM. Under these conditions the sensitivity of the experiment is expected to 
detect binders with Kd<20-60|aM. Crystals were allowed to equilibrate for 4 hours and 1 5 

20 minutes. 

Data were collected at the Argonne National Labs advanced photon source 
synchrotron ID beamline IMC A equipped with a MarCCD camera. Data consisted of 100 
1 0 oscillations with 7 sec exposures. Data were 87.4% complete at 1 .6A resolution with 
an overall merging R-factor of 5.4%. Data were processed by the DHN/O program 
25 package. Otwinowski et al.. Methods in Hn/ymokuzy. 276 ( 1006). and the electron densit\ 
map calculated by the XPI.OR package. 

The electron density map was inspected on a Silicon Graphics INDl(i()2 
workstation using the QUANTA 07 program package (Molecular Simulations Inc.. Quanta 
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Generating and Displaying Molecules , San Diego: Molecular Simulations Inc.. 1997). 
The shape of the density at the active site was visually identified as resulting from one of 
the compounds in the mixture indicating a positive hit which was identified as 
compound 19. and is illustrated in Figure 12. 

5 

This method is preferable for discovering lead compounds. Fead compounds 
would typically have the characteristics of being tighter binders (for example, within the 
sensitivity range of the method). This method also allows screening of a 10,000 
compound non-directed library on the timeframe of 1-2 weeks. This method would be 
10 used in conjunction with the other methods of screening 10-20 compounds at a time where 
weaker binders would be identified. These binders would be less likely to serve as lead 
compounds, but could be attached to a lead scaffold in order to increase the potency. 

15 Example 4: CrystaFFAD IM screen i ng of FrmC ' 

FrmC* is an rRNA methyltransferase that transfers a methyl group from S- 
Adenosyl-F-methionine to N6 of adenine w ithin the peptidyltransferase loop of 23 S 
rRNA. This methylation confers antibiotic resistance against a number of macrolide 
20 antibiotics such as the widely prescribed erythromycin. Inhibition of FrmC* would be 

expected to reverse resistance. In order to design a specific and potent FrmC" inhibitor, 
the cofactor or S-Adenosyl-F-methionine binding site has been targeted. S-Adenosyl-F- 
methionine is illustrated below as compound 42: 
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NH 2 



10 42 

The crystal structure of HrmC* shows that the S-Adenosyl-L-inethionine site is 
composed of two primary pockets which accommodate the adenine ring and the 
methionine. In addition, there is a third pocket which may accommodate the rRNA 
15 adenine that undergoes methylation. In order establish an SAR at this site, a library of 
adenosine analogues substituted at N6 and/or 5' hydroxy 1 was generated- The sites of 
variation in the library is represented below as compound 43.. 



20 




R . 



43 
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I-rmC e xpression and purif ication 

The expression vector pTERM31 was constructed by polymerase chain reaction 
5 (PCR) amplification of the ermC* gene and the upstream kdsB cistron from pERM-1 . 

Suheloning the PCR product into pET24 f (Novagen. Madison, \VI) was performed using 
the Bam\\\ and ////?dIII sites included in the "tailed" PCR primers. This new construct 
allowed the expression of ErmC* by translational coupling to kdsB, under the control of 
the Tllac promoter. pTKRM31 plasmid was transformed into E. coli strain 

10 BE21 9(DE3)/pEysS (Novagen) and the resulting strain was used for production of ErmC. 
Transformed cells were grown at 27.5°C in a New Brunswick Scientific (Edison, NJ) 
Micros fermentor containing 10 1 of Superbroth (BIO 101 . Ea Jolla, CA). supplemented 
with kanamycin, chloramphenicol, and glucose. When the culture optical density reached 
1.10. ErmC* expression was induced by the addition of ImM isopropyl fi-d- 

15 thiogalactopyranoside (IPTG). Cells were harvested 400 minutes post-induction. 

Frozen cell paste (2O0-250g) was thawed at room temperature and resuspended into 5-10 
volumes of cold lysis buffer (50mM Tris. 5mM 1 ,4-dithiothreitol (DTT). ImM 
phenylmethylsulfonate fluoride (PMSF). 2mM ethvlene-diaminetetraacetic acid (EDIA), 0.2% 
Triton X-100, ph 7.8). The cells were lysed w ith a French press and cell debris removed by 

20 centrifugation. The supernatant was dialysed overnight against 20 1 Tris-DT I -glycerol-magnesium 
(TIXiM) buffer. pH 7.8 (5()mM Tris. 5mM D TT. 10% glycerol. lOmM MgC12). The dialysate was 
then applied to a Sepharose Fast Flow column (Pharmacia) that had been pre-equilibrated in TIXiM 
buffer. Fractions were assayed for methyltransferase activity and those containing ErmC* were 
pooled, applied to a TSK SP-5PW column (Tosollaas. Montgomeryville. PA), and eluted with an 

25 NaCl gradient. The purified protein was then concentrated on a YM-10 (Amicon) membrane. 

ErmC Crystal Structure 

Crystals of ErmC" were grown by the hanging drop vapor diffusion method. 
Drops containing 5-8 mgs ml ErmC* in 25mM I ris CI. IDOmM NaCl. 2mM 1) I E l()°o 
^0 (v v) glycerol, pit 7 .5 were equilibrated against a reser\ oir containing lOOmM Iris. 

500mM NH4(S())4. 15°o EECt 8000. pH 7.8. Crystals appeared within one day and grew 
to their full si/e within one week. Crystals belonged to the space group P43212. The 
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structure of HrmC* in this space group was determined by molecular replacement to 2.2 
angstrom resolution using the crystal structure of HrmC" in the space group P6 (Bussiere 
et al.. Biochemistry Vol. 37. pp 7103-71 12). The 3-D structure for crystals grown under 
these conditions show an empty active site making this a system highly suitable for 
5 application of CrystaHHAD iM . 

Screening 

The adenosine library consisted of 59 compounds. The library was divided into 7 
mixtures of 8-9 shape-diverse compounds and screened by the CrystaLF:AD IM method. 

10 Specifically, each compound was dissolved in 100% DMSO to a final concentration of 1M 
(or saturation for the less soluble). Equal volumes of each compound were mixed to 
assemble the mixture of 10. Single HrmC crystals were placed in 50|il of 20% PHG 8000. 
0.3M ammonium sulfate. 10% glycerol, pH 7.7 and 0.5-0.8(^1 of the compound mixture 
added to give 1 to 1.6% DMSO and 3.3 to 5.2jaM final individual compound 

1 5 concentration. Crystals were allowed to equilibrate for 3-4 hrs. 

Data were collected on a Rigaku R I P 300 RC rotating anode source with a 
RAXISII, MAR image plate, or MAR CCD detector, f or the image plate systems, typical 
data consisted of 1 5-20 2° oscillations with 2()-30min exposures while for the CCD 1 5-20 
2.0° oscillations were exposed for 8-15 minutes. Typical usable data were 80-90% 
20 complete at 3.4-3.6A resolution w ith merging R-factors of 7-16%. This was required to 

adequately visualize and identify inhibitors in the Fo-Fc or 2Fo-Fc maps. For these maps, 
the starting model had been refined to 2.2A resolution (R=22% Rf rcc =25%). Data were 
processed by the DHN/O program package and the electron density maps calculated by 
the XPFOR package. 

25 [electron density maps were inspected on a Silicon Graphics INDIG02 workstation 

using QCANTA 97. The shape of the density at the active site was visually identified by 
the shape of one or more of the compounds in the mixture to indicate a positive hit or by 
ordered water molecules indicating the absence of binding. For experiments which 
resulted in a positive hit. the appropriate compound was \isually moved into the electron 

30 density. 1 he electron densiu maps were also checked for am changes in the protein 
structure, and if observed, corresponding modifications were made in the structure. 



6308. rs. 1)1 



-36- 

Hence, alter the map inspeetion/compound-fitting step, the detailed 3-D structure of the 
eompound:protein complex was known. 

Two hits were detected in the I:nnC" adenosine analogue screen (compounds 44 
and 45). figures 13 and 14 show the crystal structure of the complexes of compounds 44 
5 and 45 with FrmC\ 




44 45 

In all cases, the electron density shape identified the binding compound. The 
10 hydrophobic substitution was found to bind along a partially exposed hydrophobic surface 
suggesting a preferred interaction w hich may have contributed to the binding of these 
compounds, allowing them to be pulled out as hits. No hits containing a substitution at the 
5"()H position were detected. A follow-up compound to compounds 44 and 45 contained 
an optimized indane substituent at this hydrophobic site. 



15 



